Patent Map Generation Using Concept-Based Vector Space Model
نویسندگان
چکیده
This paper proposes a patent map generation system using concept-based vector space model and presents evaluation results from the NTCIR-4 patent feasibility study (FS) task. The concept-base is a knowledge base of words, which expresses each word as an associated vector. The word vectors are computed based on word co-occurrence in a target document set, therefore, the word vectors reflect target documents’ characteristics. Each document in the target document set is expressed as a vector that is composed from vectors associated with words included in the document. The word vectors and document vectors are positioned in an identical vector space and relevant degree between any two words and/or documents can be computed as a cosine coefficient of two vectors. Taking advantage of this model, problems sections and solutions sections of patent documents are expressed as vectors, then, they are clustered and the label word for each cluster are chosen from words which gives high cosine coefficient to the center of gravity of the cluster. A trial of generating patent maps for NTCIR-4 patent FS task topics using the system has been done. Comparing with human-generated patent maps, the system provides fairly good accuracy of clustering of target patents but poor accuracy of cluster labeling.
منابع مشابه
An Automated Research Paper Classification Method for the IPC system with the Concept Base
In the present paper, a classification method using the Concept Base is proposed and evaluated in the Patent Mining Task of the NTCIR-7 workshop. In this task, research papers are classified into the International Patent Classification (IPC) system. The classification enables research papers to be located on a patent map. In order to classify a paper, patent documents that are similar to the pa...
متن کاملPrior Art Search using International Patent Classification Codes and All-Claims-Queries
In this study, we describe our system at the Intellectual Property track of the 2009 CrossLanguage Evaluation Forum campaign (CLEF-IP). The CLEF-IP track addressed prior art search for patent applications. We used the Apache Lucene IR library to conduct experiments with the traditional TF-IDF-based ranking approach, indexing both the textual content of each patent and the IPC codes assigned to ...
متن کاملInvestigating the Effect of Functionality Level of Analogical Stimulation on Design Outcomes
Design-by-analogy is a growing field of study and practice, due to its power to augment traditional concept generation methods by expanding the set of generated ideas using similarity relationships from solutions to analogous problems. A new method for extracting functional analogies from data sources has been developed to assist designers in systematically seeking and identifying analogies fro...
متن کاملGeometric analysis of concept vectors based on similarity values
In this paper, we offer a geometric framework for the computing of a concept’s conceptual vector based on its similarity position with other concepts in a vector space called concept space, which is a set of concept vectors together with a distance function derived from a similarity model. We show that there exists an isometry to map a concept space to a Euclidean space. So, the concept vector ...
متن کاملA Note on Quadratic Maps for Hilbert Space Operators
In this paper, we introduce the notion of sesquilinear map on Β(H) . Based on this notion, we define the quadratic map, which is the generalization of positive linear map. With the help of this concept, we prove several well-known equality and inequality...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004